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FINAL  REPORT  SUMMARY 


We  have  analysed,120  population  samples  of  ^edes  aegypti  for  genetic 


variation  at  up  to  22  protein-coding  loci.  These  samples  originated  from 
75  localities  throughout  the  world-wide  distribution  of  the  species.  The 
most  important  finding  was  that  the  species  could  be  divided  into  7  groups 
based  on  geographic  and  subspecific  criteria.  These  7  groups  are  West  Africa, 
East  Africa  subspecies  formosus ,  Asia,  Southeast  United  States,  Texas/Mexico, 
South/CentralAmerica,  Caribbean,  and  East  Africa  subspecies  aegypti .  These 


groups  are  sufficiently  distinct  to  allow  assignment  of  a  population  of  unknown 
origin  to  one  of  the  groups  with  a  90%  or  greater  probability  of  correct  assign- 


Some  of  these  samples  have  also  been  stud^^  for  their  ability  to  transmit 
yellow  fever  virus.  Considerable  variation  for  this  trait  was  also  found. 
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The  main  purpose  of  this  contract  was  to  analyse  populations  of  Aedes 


aegypt i  for  genetic  variability  throughout  its  distribution.  The  species 
is  very  widespread  throughout  the  tropical  and  subtropical  zones  around 
the  world.  Thus,  there  is  a  lot  of  territory  to  cover. 

We  have  obtained  about  120  samples  from  75  different  localities^  from 
some  localities  more  than  one  sample  were  obtained.  These  samples  represent 
as  well  as  possible  the  world-wide  distribution  of  the  species.  Generally 
a  sample  consisted  of  50  to  300  individuals. 

Genetic  variation  was  determined  at  up  to  22  enzyme-coding  loci  by 
the  method  of  gel  electrophoresis.  Individuals  collected  in  nature  were 
either  electrophorejgJ  directly  or  F!  offspring  were  used  in  instances  where 
too  few  individuals  from  nature  were  obtained.  Thus  we  avoided,  as  much  as 
possible,  artifacts  inherent  in  long-term  laboratory  rearing.  There  were 
about  50  alleles  at  the  22  loci  studied.  The  data  consist  of  allele  frequencies 
at  each  locus  for  each  sample.  Thus  the  total  data  set  is  about  6,000  allele 
f requeue ies . 

In  order  to  make  this  large  amount  of  data  comprehensible,  we  have  used 
various  analytical  procedures,  some  of  which  are  fairly  standard  for  population 
genetic  analysis,  while  others  were  developed  by  us  to  specifically  answer 
quest  long  pertinent  to  Aedes  aegypt i . 

While  not  directly  supported  by  this  contract,  some  samples  obtained  under 
it  were  used  in  another  study  of  the  genetic  basis  of  yellow  fever  transmission. 
The  attached  preprint  of  a  paper  from  a  symposium  summarizes  this  work. 

All  the  work  supported  by  this  contract  has  been  published.  Reprints  have 
been  supplied  in  previous  reports.  Appended  here  is  a  complete  list  of  publi¬ 
cations  which  eminated  from  work  supported  by  this  contract. 


I960  Powell,  J.R.,  W.  Tabachnlck  and  J.  Arnold.  I98O.  Genetics  and  the  origin 
of  a  vector  populations:  Aedes  aegyptl,.  a  case  study.  Science 
208:1385-1387. 

1962  Powell,  J.R.,  W.  Tabachnlck  and  G.P.  Wallis.  1982.  Aedes  aegvptl  as  a 
model  of  the  usefulness  of  population  genetics  of  vectors.  In: 

Recent  Developments  la  the  Genetics  Insect  Disease  XafttorJ,  A 
Symposium  Proceedings,  pp.  396-^112,  edited  by  W.W.M.  Steiner  oi. , 
Stipes  Publishing  Co. ,  Champaign,  XL. 

Tabachnlck,  W. ,  T.H.G,  Aitken,  B.J.  Beaty,  B.R.  Miller,  J.R.  Powell  and 
G.P.  Wallis.  1982.  Genetic  approaches  to  the  study  of  vector 
competency  In  Aedes  aegyptl.  In:  Recent  la  HiS.  G^ng-tlca 

Insect  Disease  Vectors,  a  Symposium  Proceeding,  pp.  1113-432,  edited  by 
W.W.M.  Steiner  il. ,  Stipes  Publishing  Co. ,  Champaign,  IL. 

Tabachnlck,  W.J.  and  D.J.  Howard.  1982.  Genetic  control  of  lexoklnase 
variation  In  Insects.  Biochemical  Genetics  20:47-57. 

Wallis,  G.P.  and  W.J.  Tabachnlck.  1982.  Linkage  of  isocltrate 

dehydrogenase  with  sex  and  two  lethals  in  Aedes  aegypti.  Jggraal  ■&£ 
Heredity  73:291-294. 

1983  Wallis,  G.P.,  W.J.  Tabachnlck,  and  J.R.  Powell.  Macrogeographic  genetic 

variation  In  a  human  commensal:  Aedes  aegvptl.  the  yellow  fever 
mosquito,  Genetlcal  Research  41:241-258. 

1984  Wallis,  G.P.,  W.J.  Tabachnlck,  and  J.R.  Powell.  Genetic  heterogeneity 

among  Caribbean  populations  of  Aedes  aegyptl.  Jpumal  ££  IroPlcal 
Medicine  ood  Hygelne  33 : 492-498 . 


IN  PRESS 

Powell,  J.R.  Geographic  genetic  differentiation  and  arbovirus  competency: 
Aedes  aegyptl  and  yellow  fever.  Proceedings  of  a  symposium,  Gqn^tlC 
Approaches  la  Uhs  Study  of  Parasites  ood  Pl,g.taga  Yg.C.tgXa  edited  by  M. 
Coluzzi,  Rome,  December  1983. 


Geographic  Genetic  Differentiation  and  Arbovirus  Competency 
Aedea  aegvnti  and  Yellow  Fever 


Jeffrey  R.  Powell 
Department  of  Biology 
Tale  Dniveralty 
New  Haven,  CT  06511  USA 


In  diploid,  sexually  reproducing  organiaas,  no  two  Individuals  are  genetically 
Identical  (with  the  possible  exception  of  aonocygotlc  twins).  Thus,  while  It  is 
possible  to  define  a  species  as  a  coherent  unit,  one  aust  always  reaeaber  such  units 
are  coaposed  of  populations  of  diverse  Individuals.  In  parasitology  when  one 
designates  a  species  as  the  pathogen,  vector,  or  host  It  aust  be  borne  In  aind  that 
considerable  variation  aaong  Individuals  exists  In  each.  Soae  of  this  variation  aay 
have  considerable  relevance  with  respect  to  disease  transalsslon.  One  need  only 
aentlon  drug  resistance  In  Plasaodlua  and  sickle-cell  anemia  In  humans  to  underscore 
this  point. 

Several  years  ago  we  embarked  on  a  population  genetic  analysis  of  aegvptl. 

the  major  vector  of  urban  yellow  fever.  Our  Initial  Interest  In  the  species  was  due 
primarily  to  Its  Interesting  evolutionary  history.  It  has  long  been  recognized  as  a 
polytypic  species,  l.e.  has  genetic  variation  for  morphological  traits.  Further 
studies  revealed  genetic  variation  for  a  number  of  physiological  and  behavioral 
traits  as  well.  Ecological  studies,  especially  in  East  Africa,  identified  what  Is 
the  probable  ancestral  sylvan  form  which  gave  rise  to  the  domestic  form  which  in  turn 
spread  around  the  world  as  a  human  commensal.  Because  the  most  widespread  form  Is 
associated  almost  exclusively  with  humans,  educated  guesses  can  be  made  as  to  routes 
and  times  of  Introduction  and  levels  of  gene  flow  aaong  populations.  These  factors, 
plus  the  fact  It  Is  quite  easy  to  rear  In  the  laboratory,  made  Aedes  aegvptl  an 
attractive  species  for  population  genetic  studies. 

Colncldently,  there  Is  a  research  group  at  Tale  University  interested  in  genetic 
variation  In  Insect  vector  ability  to  transmit  arboviruses.  These  researchers  are 
members  of  the  Tale  Arbovirus  Research  Unit  (TARU)  In  the  Department  of  Epidemiology 
and  Public  Health.  For  the  past  four  years  we  have  been  involved  In  cooperative 
research  with  this  group  In  an  attempt  to  combine  knowledge  of  the  population  genetic 
structure  of  Aedes  aegvptl  with  studies  of  yellow  fever  virus  vectorial  capacity.  In 


this  oontrlbutlon,  I  sunarize  the  results  ve  hsee  obtslned. 

At  this  point,  I  Bust  saphsslze  that  the  vork  reported  here  Is  the  result  of  the 
c<Biblned  efforts  of  several  workers.  Heebers  of  the  TARO  team  Include  T.H.G.  Altken, 
B.  J.  Beaty,  L.  Lorenz,  B.  R.  Miller,  and  R.  Tesh.  In  my  laboratory  in  the  Yale 
Department  of  Biology  the  main  contributors  have  been  V.  J.  Tabachnlck,  G.  P.  Wallis, 
and  G.  D.  Amato. 


POPULATIOII  OEHETICS  OP  aagrotl 

ka  Bentloned  above,  genetic  variation  in  1.  aeavoti  had  been  knovn  £or  aoae  tlae. 
The  type  of  variation  studied  had  been  aorphological,  jAiyslological,  and  behavioral. 
While  these  data  indicated  genetic  differentiation  aaong  populations,  it  is  difficult 
to  use  such  data  to  quantify  genetic  differentiation.  Therefore,  we  chose  to  use 
Isozyne  techniques  -to  study  patterns  of  genetic  variation  and  differentiation  in  this 
species.  Data  resulting  from  such  studies  have  two  especially  useful  properties: 
First,  the  data  are  quantifiable;  one  actiially  counts  gene  frequencies  and  obtains 
exact  numbers  for  the  sample.  Second,  several  genes  distributed  all  over  the  genome 
can  be  assayed.  This  latter  fact  means  that  the  results  obtained  from  such  studies 
probably  reflect  the  overall  genetic  differentiation  of  populations.  The  various 
markers  (genes)  reflect  the  differentiation  at  that  locus  and  segments  of  the 
chromosome  surrounding  it.  The  genes  themselves,  coding  for  enzymes,  may  not  be 
Important  per  se:  rather  they  are  convenient  markers  which  hopefully  reflect  the 
genome  as  a  whole.  If  so,  then  the  patterns  of  variation  discerned  for  isozymes  may 
be  reflected  in  variation  in  other  traits,  such  as  efficiency  of  disease 
transmission.  As  will  become  clear,  this  is  at  least  partly  true  for  aegvptl  and 
yellow  fever. 

We  initially  studied  about  25  enyme-coding  loci  in  aegvpti.  Of  these  10  or  11 
proved  to  be  polymorphic  and  therefore  useful  in  determining  patterns  of 
differentiation.  To  date,  we  have  analyzed  more  than  100  samples  from  about  70 
localities  throughout  the  distribution  of  the  species;  more  than  one  sample  was 
obtained  from  several  localities.  Each  of  the  100  samples  averages  about  50 
Individuals  (100  genes)  analyzed  for  variants  at  each  locus.  Thus  one  can  appreciate 
that  the  data  from  these  studies  is  rather  extensive;  presenting  table  after  table  of 
gene  frequencies  would  not  be  particularly  enlightening.  Rather  we  have  attempted  to 
use  several  statlstioal  methodologies  to  staaqrlze  the  data  for  various  purposes. 
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One  will  be  presented  here. 
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In  analyzing  genetic  variation  In  vectors  one  useful  question  vhlch  Bay  be  asked 
Is  whether  there  Is  sufficient  differentiation  to  be  able  to  assign  a  population  of 
unknown  origin  to  the  region  from  which  It  originated.  That  is,  do  populations 
occupying  different  regions  of  a  species  distribution  have  gene  frequency  differences 
of  great  enough  magnitude  to  discriminate  among  regions?  There  exists  a  statistical 
procedure  designed  to  answer  such  questions;  this  is  the  stepwise  multivariate 
discriminant  analysis.  In  order  to  use  this  procedure  one  must  first  define  the 
groups  to  be  discriminated.  Using  other  analytical  procedures,  we  defined  eight 
genetically  distinct  groupings  of  A*  aegvptl  populations.  These  are:  East  Africa 
subspecies  aegvntl  (abbreviated  EAA),  East  Africa  subspecies  formosus  (EAF),  West 
Africa  (VA),  Asia,  Southeast  United  States  (SEUS),  Texas  and  Mexico  (TEIMEX),  Central 
and  South  America  Including  Trinidad  (CSA),  and  the  Caribbean  (CAR). 

Once  groupings  are  defined,  the  discriminant  analysis  determines  the  composite 
variable  baaed  on  allele  frequencies  which  best  discriminates  the  groups.  The 
variable  Is  given  coefficients  which  maximizes  the  ratio  (F)  of  between  group 
variance  to  within  group  variance.  The  resulting  variable  is  called  the  first 
canonical  variable.  The  variance  in  gene  frequencies  correlated  to  this  first 
canonical  variable  Is  removed  from  the  data  and  a  second  composite  variable  Is 
determined  based  on  the  remaining  variance.  The  procedure  can  proceed  until  all  the 
variance  In  gene  frequencies  Is  exhausted.  Fortunately,  for  our  data  the  first  two 
canonical  variables  account  for  a  large  fraction,  up  to  90$,  of  the  total  variance. 
This  means  that  the  essence  of  the  analysis  can  be  displayed  in  a  two  dimensional 
graph. 

Figure  1  la  an  Illustration  of  such  an  analyaia  on  the  more  than  60  localities  we 
have  studied.  (If  more  than  one  sample  has  been  analysed  for  a  locality,  mean  gene 
frequencies  across  samples  were  used.)  As  oan  be  seen,  EAA  localities  are  widely 
separated  from  all  the  others,  which  are  mot  well  separated  toward  the  left  of  the 


graph.  Because  of  the  distortion  generated  by  the  Inclusion  of  EAA,  one  can  rerun 
the  analysis  oaitting  the  three  EAA  localities.  The  results  are  in  Figure  2.  Six  of 
the  remaining  seven  groups  initially  defined  are  separated.  TEZMEX  and  CSA  are 
indistinguishable.  Therefore  we  conclude  that  it  may  not  be  valid  to  consider  these 
separate  groups.  Based  on  this  analysis  then,  we  can  legitimately  define  seven 
genetically  distinct  groups  of  populations  of  ±,.  aegvpti.  In  most  cases  there  is  a 
greater  than  90}  probability  of  assigning  a  population  of  unknown  origin  to  the 
correct  group  based  on  allele  frequencies  at  these  ten  loci.  (For  more  details  of 
the  data  and  procedures,  see  Wallis  et  al. .  1983,  and  references  therein.) 

We  should  note  that  in  defining  genetically  distinct  groups  in  this  manner, 
groupings  may  change  as  more  samples  are  obtained.  Groups  may  be  merged  and  others 
erected.  However,  with  our  A.  aegvptl  data,  the  groupings  have  remained  remarkably 
stable  as  we  have  added  more  samples.  We  first  attempted  to  define  groups  and 
subject  them  to  discriminant  analysis  when  we  had  34  samples  from  28  localities 
(Powell  et  al..  1980).  At  that  time  we  defind  seven  groups,  the  same  as  before 
except  for  TEXMEX  for  which  we  had  no  samples.  When  samples  from  Texas  and  Mexico 
were  obtained,  we  noted  that  allele  frequencies  were  distinct  from  other  U.S. 
populations  and  therefore  erected  a  new  group.  Now  with  more  South  American  seunples, 
it  appears  that  perhaps  this  TEXMEX  should  be  merged  with  CSA.  Except  for  this 
change,  nearly  tripling  the  number  of  samples  has  not  disrupted  the  groupings.  This 
Increases  our  confidence  that  the  groups  are  real  and  not  artifacts  of  the  analytical 
procedure. 
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TELLOH  FETEB  COMPETENCY 

Twenty- four  of  the  population  aaaplea  have  also  been  studied  for  their  ooapetency 
to  be  Infected  with  yellow  fever  virus  (Tabachnick,  et  al..  In  preparation). 
Mosquitoes  were  fed  on  a  suspension  of  virus,  and  allowed  to  Incubate  for  14  days  at 
26**  C.  Heads  were  then  squashed  and  the  presences  of  virus  detected  by  fluorescent 
antibodies.  (Details  will  be  presented  elsewhere).  Table  1  shows  the  results.  The 
rate  of  Infection  varied  from  7  to  57%,  Table  2  Is  a  statistical  analysis  of  these 
data.  In  this  Table  we  have  Included  populations  which  had  been  in  laboratory 
culture  for  three  generations  or  less.  The  reason  Is  that  we  have  evidence  that 
under  culture,  strains  change  genetically. 

While  the  correlations  are  far  from  perfect,  the  genetic  groupings  we  defined  and 
the  rates  of  Infectlvlty  are  related.  The  groups  CAR,  CSA,  EAF,  tend  to  be  high  In 
Infectlvlty  while  WAF  and  TEIMEX  are  low.  ASIA  falls  Intermediate.  SEUS  Is  quite 
heterogeneous,  having  populations  both  relatively  high  and  relatively  low. 

This  nonrandom  listing  of  populations  In  Table  2  argues  for  a  genetic  component 
in  determining  Infectlvlty.  We  would  not  argue  that  the  Isozymes  themselves  are 
Involved  In  Infectlvlty.  However  they  seem  to  be  marking  areas  of  the  genome  which 
are  Important. 

I  should  also  mention  that  we  have  done  selection  experiments  trying  to  derive 
populations  higher  and  lower  in  susceptibility  to  yellow  fever  infection.  These 
attempts  have  been  partially  successful.  However,  such  studies,  like  those  presented 
In  Tables  2  and  3,  do  give  indications  of  genetically  determined  variation  for  this 
trait  which  Is  probably  quite  complex.  That  Is,  many  genes  each  with  small  effects 
are  likely  to  be  Involved.  This  makes  more  so^lsticated  genetic  analysis  very 
difficult. 

It  Is  interesting  to  note  that  of  all  the  genetic  studies  on  arbovirus  competency 
In  mosquitoes  (revleved  In  Oubler,  et  al..  19^)  none  have  found  simple  genetic 


control.  Thl8  is  In  contrast  to  studies  on  other  parasitic  diseases  such  as  aalarla 
and  fllariasis  for  which  single  aendelian  factors  have  been  iapllcated  In  determining 


vectorial  competency. 
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TABLE  1 

Rates  of  InfectlTlty  with  yr''low  foTor  Tima  for  populations  of  Aadas  aagvptl. 


GENERATION  NUMBER  INFECTED 

GROUP  POPULATION  TESTED  - 

NUMBER  TESTED 


%  INFECTED 


WAF 

Dakar,  Senegal 

F2 

N'goye,  Senegal 

F3 

Gambia 

F6 

EAF 

Kampala,  Uganda 

FI  4 

Kombeni,  Kenya 

F8 

EAA 

Kwa  Dzlvo,  Kenya 

F3 

Majengo,  Kenya 

F3 

ASIA 

Amphur ,  Thai land 

F29 

Bangalore,  India 

F3 

Fiji 

FI 

TEXMEX 

Austin,  Texas 

F3 

Galveston,  Texas 

FI 

Houston,  Texas 

FI 

Weslaco,  Texas 

FI 

Victoria,  Mexico 

F3- 

SEUS 

Abbeville,  Louisiana 

F1 

Beaumont,  Texas 

FI 

Vero  Beach,  Florida 

FO 

CSA 

Esculntla,  Guatemala 

FI 

Malaga,  Colombia 

FO 

Santa  Cruz,  Bolivia 

F4 

Trinidad 

FI, 

CAR 

Montserrat 

FO 

Puerto  Rico 

F3 

10U/184 


TABLE  2 


Populations  In  Table  2  In  deoraaalng  order  of  susceptibility  to  oral- Infection 
vitb  yellow  fever  virus.  Vertical  lines  connect  populations  showing  no  significant 
(p  0.05)  difference  in  infectivlty  by  a  chi-squared  test. 


POPULATION 

GROUP 

Kva  Dalvo 

BAA 

Montserrat 

CAR 

Malaga 

CSA 

Vero  Beach 

SEUS 

Trinidad 

CSA 

Puerto  Rico 

CAR 

Santa  Cruz 

CSA 

Esculntla 

CSA 

Majengo 

BAA 

Austin 

TEXMEX 

Beaumont 

TEXMEX 

Bangalore 

ASIA 

Fiji 

ASIA 

Houston 

TEXMEX 

Victoria 

TEXMEX 

Galveston 

TEXMEX 

Weslaco 

TEXMEX 

Abbeville 

SEUS 

Dakar 

WAP 

Ngoye 

WAF 
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PIOURE  1 

Canonical  plot  of  Aadea  aagvotl  populationa  baaed  on  10  polyaorphlc  tsozyae  loci. 
Each  population  la  repreaented  by  the  Initial  letter  of  Ita  group,  except  z  for  EEA; 
overlap  of  populationa  la  repreaented  by  *.  Nunbera  are  the  mean  poaltlon  for  the 
group. 


FIGURE  2 

Same  aa  Figure  1  omitting  EEA  populations. 


\r " 


Canonical  variable  I 
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